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I. BOLTZMANN AND ENTROPY 



Since the earliest works Boltzmann aimed at a micro- 
scopic interpretation or "proof" of the newly formulated 
second Law of Thermodynamics and of the associated 
concept of entropy, [1]. At the time, since the works 
of Bernoulli, Avogadro, Herapath, Waterstone, 
Kronig, CLAUSIUS it was well established that there 
should be an identification between absolute temperature 
and average kinetic energy at least in gas theory. 

Boltzmann starts in [1] by stating clearly that in gen- 
eral (i.e. not only for gases) temperature and average ki- 
netic energy must be identified and gives a derivation of 
the second law for isocore (i.e. constant volume) trans- 
formations. This first derivation makes use of a period- 
icity assumption on the motion of (each) gas particle to 
obtain the existence of the time average of the kinetic 
energy and seems to fail if the motions are not periodic. 

Nevertheless Boltzmann insisted in conceiving ape- 
riodic motions as periodic with infinite period: see [1] 
where on p. 30 one finds that "... this explanation 
is nothing else but the mathematical formulation of the 
theorem according to which the paths that do not close 
themselves in any finite time can be regarded as closed in 
an infinite time" . He therefore pursued the implied me- 
chanical proof of Thermodynamics to the extreme conse- 
quences. 

A "proof" of the second law meant to look for prop- 
erties of the trajectories in phase space of a mechanical 
equation of motion, like time averages of suitable observ- 
ables, which could have the interpretation of thermody- 
namic quantities like pressure p, volume V, energy U, 
temperature T, entropy S and be related by the thermo- 
dynamic relations, namely 

ds = du + T pdv (1) 

where dU, dV, dS are the variations of U, V, S when 
the control parameters of the system are infinitesimally 
changed. 

The extreme consequence was the ergodic hypothesis 
which was first mentioned by Boltzmann around 1870, 
see p. 237 in [2], for the internal motion of atoms in 



single molecules of a gas ("it is clear that the various gas 
molecules will go through all possible states of motion" 
which, however, could possibly be understood from the 
context to be different from the ergodic hypothesis, see 
[3], because the molecules undergo from time to time 
collisions). See also p. 96 in [4] and p. xxxvii in the 
recent collection by Flamm [5]. 

Considering a collection of copies of the system alike 
to a large molecule, p. 284 in [6], the same assumption 
became what is often referred as the ergodicity property 
of the entire gas. It implied that, by considering all mo- 
tions periodic, kinetic energy equipartition would follow 
and, better (see p. 287 in [6]), even what we call now 
the microcanonical distribution would follow (as well as 
the canonical distribution) . The hypothesis was taken up 
also by Maxwell (1879), see p. 506 in [3]). [53] 

In this way Boltzmann was able to derive various 
thermodynamic consequences and a proof of 1, see [7], 
and was led to exhibiting a remarkable example of what 
later would be called a "thermodynamic analogy" (Sec. Ill 
of [7] ) . This meant the existence of quantities associated 
with the phase space of a mechanical equation of motion 
(typically defined as time averages over the solutions of 
the equations of motion), which could be given thermo- 
dynamic names like equilibrium state, pressure p, volume 
V , energy U, temperature T, entropy S and be related by 
the thermodynamic relations that are expected to hold 
between the physical quantities bearing the same name, 
namely 1. 

The notion of mechanical thermodynamic analogy was 
formulated and introduced by Helmoltz for general 
systems admitting only periodic motions (called mono- 
cyclic), [8, 9]. The proposal provided a new perspective 
and generated the new guiding idea that the Thermody- 
namic relations would hold in every mechanical system, 
from the small and simple to the large and complex: in 
the first cases the relations would be trivial identities of 
no or little interest, just thermodynamic analogies, but 
in the large systems they would become nontrivial and 
interesting being relations of general validity. In other 
words they would be a kind of symmetry property of 
Hamiltonian Mechanics. 

The case of spatially confined systems with one degree 
of freedom was easy (easier than the example already 
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given in [7]): with all motions periodic, the microscopic 
state was indemnified with the phase space point (p, q) 
representing the full mechanical state of the system, the 
macroscopic state in the corresponding thermodynamic 
analogy was identified with the energy surface H (p, q) = 
■^—p 2 + <Pv(l) — U, where m is the mass and cpy is the 
potential energy which confines the motion in position 
space, i.e. in q, and depends on a parameter V. The 
state is completely determined by two parameters U, V. 

Average kinetic energy T = lim T ^oo - JJ" K(p(t))dt 
is identified with temperature; energy is identified with 
U : then if pressure p is defined as the time average 
linv^oo — i Jq dv<Pv(q(t))dt the quantities T,p become 
functions p = p(U,V),T = T(U,V) of the parameters 
U, V determining the state of the system and the 1 should 
hold. 

Indeed the limits as r — ► oo exist in such a simple case, 
in which all motions are periodic and confined between 
q± = q±(U,V) (where U = ?v{q±)); it is dt = = 

, dq and the period of the oscillations is t — 

^/2(U- Vv (q))/ m 

t (U,V) = 2 f q+ — ^ , hence (p. 127 in [10] 

and Ch. I in [11]), 

T = £ P q + Jy] f y/ZiU-rvb))*!, (2) 

_2_ r1+(U,V) d vV y{q) i 
P~ to U-{U,V) V ^ (U ^ v{q)) d< i 

and it is immediate to check, as in [7], that 1 is fulfilled 
by setting 

S(U,V) = 2 log J H=u pdq = 

= 2 log/£(KT) y/2m(U-<p v (q))dq (3) 

The case of the central motion studied in [7] was another 
instance of monocyclic systems, i.e. systems with only 
periodic motions. 

Then in the fundamental paper [10], following and in- 
spired by the quoted works of Helmoltz, Boltzmann 
was able to achieve what I would call the completion of 
his program of deducing the second law (1) from Mechan- 
ics. If 

(1) the absolute temperature T is identified with the av- 
erage kinetic energy over the periodic motion following 
the initial datum (p, q) of a macroscopic collection of N 
identical particles interacting with a quite arbitrary pair 
interaction, and 

(2) the energy U is H(p, q) sum of kinetic and of a po- 
tential energy, 

(3) the volume V is the volume of the region where the 
positions q are confined (typically by a hard wall poten- 
tial), 

(4) the pressure p is the average force exercized on the 
walls by the colliding particles, 

then, from the assumption that each point would evolve 



periodically visiting every other point on the energy sur- 
face (i.e. assuming that the system could be regarded as 
monocyclic, see [11] Appendix 9.3 for details) it would 
follow that the quantity p could be identified with the 
(—dyipv), time average of — dvfv, an d 1 would follow 
as a heat theorem. The heat theorem would therefore 
be a consequence of the general properties of monocyclic 
systems. 

This led Boltzmann to realize, in the same paper, 
that there were a large number of mechanical models of 
Thermodynamics: the macroscopic states could be iden- 
tified with regions of phase spaces invariant under time 
evolution and their points would contribute to the aver- 
age values of the quantities with thermodynamic inter- 
pretation (i.e. p,V 7 U,T) with a weight (hence a proba- 
bility) also invariant under time evolution. 

Hence imagining the weights as a density function one 
would see the evolution as a motion of phase space points 
leaving the density fixed. Such distributions on phase 
space were called monodic (because they keep their iden- 
tity with time or, as we say, are invariant): and in [10] 
several collections of weights or monodes were introduced: 
today we call them collections of invariant distributions 
on phase space or ensembles. Among the ensembles £, 
i.e. collections of monodes, Boltzmann singled out the 
ensembles called orthodes ("behaving correctly"): they 
were the families of probability distributions depending 
on a few parameters (normally 2 for simple one com- 
ponent systems) such that the corresponding averages 
p, V, U, T, defined in (1-4) above, would vary when the 
parameters were varied causing variations dll, dV of av- 
erage energy and volume in such a way that the r.h.s. 
of 1 would be an exact differential, thereby defining the 
entropy S as a function of state, see [11, 12]. 

The ergodic hypothesis yields the "orthodicity" of the 
ensemble £ that today we call microcanonic (in [10] it 
was named ergode): but ergodicity, i.e. the dynamical 
property that evolution would make every phase space 
point visit every other, was not necessary to the orthod- 
icity proof of the ergode. In fact in [10] the relation 1 is 
proved directly without recourse to dynamical properties 
(as we do today, see [11, 13, 14]); and in the same way 
the orthodicity of the canonical ensemble (called holode 
in [10]) was obtained and shown to generate a Thermo- 
dynamics which is equivalent to the one associated with 
the microcanonical ensemble. [54] 

In the end in [6] and, in final form, in [10] the theory of 
ensembles and of their equivalence was born without need 
of the ergodic property: the still important role of the 
ergodic hypothesis was to guarantee that the quantities 
p, V, U, T, S defined by orthodic averages with respect to 
invariant distributions on phase space had the physical 
meaning implied by their names (this was true for the 
microcanonical ensemble by the ergodic hypothesis, and 
for the other ensembles by the equivalence). 

At the same time entropy had received a full micro- 
scopic interpretation consistent with, but independent of, 
the one arising from the Boltzmann's equation in the rar- 
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efied gases case, which can be seen as a quite independent 
development of Boltzmann's work. Furthermore it be- 
came clear that the entropy could be identified, up to a 
universal proportionality constant ks, with the volume 
of phase space enclosed by the energy surface. 

Unfortunately the paper [10] has been overlooked un- 
til quite recently by many, actually by most, physicists 
possibly because it starts, in discussing the thermody- 
namic analogy, by giving the Saturn rings as an "exam- 
ple" : a brilliant one, certainly but perhaps discouraging 
for the suspicious readers of this deep and original paper 
on Thermodynamics. See p. 242 and p. 368 in [15] for an 
exception, possibly the first. 



II. BOLTZMANN'S DISCRETE VISION OF THE 
ERGODIC PROBLEM 

The ergodic hypothesis could not possibly say that ev- 
ery point of the energy surface in phase space visits in due 
time (the recurrence time) every other, see also p. 505 
and following in [3]. But this statement was attributed 
to Boltzmann and criticized over and over again (even 
by Physicists, including in the influential book, [16], al- 
though enlightened mathematicians could see better, see 
p. 385 in [15]): however for Boltzmann phase space was 
discrete and points in phase space were cells A with fi- 
nite size, that I will call h. And time evolution was a 
permutation of the cells: ergodicity meant therefore that 
the permutation was a one cycle permutation. 

This conception, perfectly meaningful mathematically, 
was apparently completely misunderstood by his critics: 
yet it was clearly stated in one of the replies to Zermelo, 
[17], and in the book on gases, [18], see also [12] and the 
de Courtenay's communication in this Symposium. 

In order to explain how a reversible dynamics could be 
compatible with the irreversibility of macroscopic phe- 
nomena he had, in fact, to estimate the recurrence time. 
This was done by multiplying the typical time over which 
a microscopic event (i.e. a collision) generates a variation 
of the coordinates of an order of magnitude appreciable 
on microscopic scales (i.e. a time interval of <~ 10~ 12 s 
and a coordinate variation of the order of 1°A) times the 
number of cells into which phase space was imagined to 
be subdivided. 

The latter number was obtained by dividing the phase 
space around the energy surface into equal boxes of a size 
h equal to the 3iV-th power of p~3 times \/mkBT with 
p the numerical density and ks Boltzmann's constant 
and T temperature. With the data for H 2 at normal 
conditions in 1cm 3 an ealier estimate of Thomson, [19], 
was rederived (and a recurrence time scale so large that 
it would be immaterial to measure it in seconds or in ages 
of the Universe). 

Of course conceiving phase space as discrete is essen- 
tial to formulate the ergodicity property in an acceptable 
way: it does not, however, make it easier to prove it even 
in the discrete sense just mentioned (nor in the sense 



acquired later when it was formulated mathematically 
for systems with continuous phase space). It is in fact 
very difficult to be a priori sure that the dynamics is 
an evolution which has only one cycle. Actually this is 
very doubtful: as one realizes if one attempts a numeri- 
cal simulation of an equation of motion which generates 
motions which are ergodic in the mathematical sense. 

And the difficulty is already manifest in the simpler 
problem of simulating differential equations in a way 
which rigorously respects the uniqueness theorem. In 
computers the microscopic states are rigorously realized 
as cells (because points are described by integers, so that 
the cells sizes are limited by the precision of hardware 
and software) and phase space is finite. By construc- 
tion simulation programs map a cell into another: but 
it is extremely difficult, and possible only in very special 
cases (among which the only nontrivial that I know is 
[20]) without dedicating an inordinate computing time 
to insure a 1 — 1 correspondence between the cells. 

Nevertheless the idea that phase space is discrete and 
motion is a permutation of its points is very appealing 
because it gives a privileged role to the uniform distri- 
bution on the phase space region in which the motion 
develops (i.e. the energy surface, if the ergodic hypothe- 
sis holds). However it is necessary, for consistency, that 
the phase space cells volume does not change with time, 
see Ch. 1 in [11]: this is a property that holds for Hamil- 
tonian evolutions and therefore allows us to imagine the 
ergodic hypothesis as consistent with the predictions of 
Statistical Mechanics. 



III. BOLTZMANN'S HERITAGE 



The success of the ergodic hypothesis has several as- 
pects. One that will not be considered further is that it 
is not necessary: this is quite clear as in the end we want 
to find the relations between a very limited number of 
observables and we do not need for that an assumption 
which tells us the values of all possible averages, most 
of which concern "wild" observables (like the position of 
a tagged particle). The consequence is that the ergodic 
hypothesis is intended in the sense that confined Hamilto- 
nian systems "can be regarded as ergodic for the purpose 
of studying their equilibrium properties" . 

What is, perhaps, the most interesting aspect of the 
hypothesis is that it can hold for systems of any size and 
lead to relations which are essentially size independent as 
well as model independent and which become interesting 
properties when considered for macroscopic systems. 

Is it possible to follow the same path in studying 
nonequilibrium phenomena? The simplest such phenom- 
ena arise in stationary states of systems subject to the 
action of nonconservative forces and of suitable heat re- 
moving forces (whose function is to forbid indefinite build 
up of energy in the system). 

Such states are realized in Physics with great accuracy 
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for very long times, in most cases longer than the avail- 
able observation times. For instance it is possible to keep 
a current circulating in a wire subject to an electromo- 
tive force for a very long time, provided a suitable cooling 
device is attached to the wire. 

As in equilibrium, the stationary states of a system will 
be described by a collection of probability distributions 
on phase space £, invariant with respect to the dynamics, 
which I call ensemble: the distributions /i in £ will be pa- 
rameterized by a few parameters U, V 1 E\ , Ei , . . . which 
have a physical interpretation of (say) average energy, 
volume, intensity of the nonconservative forces acting on 
the system (that will be called "external parameters"). 
Each distribution /x will describe a macroscopic state in 
which the averages of the observables will be their inte- 
grals with respect to fi. The equations of motion will be 
symbolically written as 

x = f (x) (4) 

and we shall assume that f is smooth, that it depends on 
the external parameters and that the phase space visited 
by trajectories is bounded (at fixed external parameters 
and initial data). 

Since we imagine that the system is subject to noncon- 
servative forces the phase space volume (or any measure 
with density with respect to the volume) will not be pre- 
served by the evolution and the divergence 

( r(x) = -^a Xi / i (x) (5) 

i 

will be different from 0. 

We expect that, in interesting cases, the time average 
<j + of a will be positive: 

a+ d = lim - / a(S t x)dt > 0. (6) 

r^oo T J 

and, with few exceptions, x-independent. 

This means that there cannnot be invariant distribu- 
tions with density with respect to the volume. And the 
problem to find even a single invariant distribution is 
notrivial, except possibly for the ones concentrated on 
periodic orbits. 

The problem can be attacked, possibly, by following 
Boltzmann's view of dynamics as discrete, ("die Zahl 
der lebendigen Kraft ist eine diskrete" , see p. 167 in [21]). 

IV. EXTENDING BOLTZMANN'S ERGODIC 
HYPOTHESIS. 



Consider a generic "chaotic" system described by equa- 
tions like 4 which generate motions confined in phase 
space. Under very general conditions it follows that 



°+ > 0, [22], and we concentrate on the case <r + > 0. 
The suggestion that phase space should be regarded as 
discrete, and motion should simply be a one-cycle per- 
mutation of the "cells" A representing the phase space 
points is still very appealing as it would lead to the unam- 
biguous determination of the invariant distribution \i de- 
scribing the statistical properties of the stationary states. 

In fact this is an assumption implicit in any claim of 
physical relevance of a simulation: as already mentioned 
above, a computer program defines a map on small cells 
in phase space. Already in the case of Hamiltonian sys- 
tems (i.e. in equilibrium theory) a simulation will not 
respect the uniqueness of solutions of the equation of 
motion because the map between the cells will not be 
invertible: it is extremely hard to write a program which 
avoids that two distinct cells are mapped into the same 
cell (see above). 

When er + > so that, in the average, phase space 
volume contracts the uniqueness problem becomes essen- 
tially unsurmountable (and not only in simulations); and 
there will be very many cells that eventually evolve into 
the same cell: thus the evolution will not be a permuta- 
tion of the cells. It will, however, become eventually a 
permutation of a subset of the initial set of cells. This 
reflects the fact that the orbits of the solutions of the dif- 
ferential equation 4 will "cluster" on an attractor which 
is a set of volume. 

The conclusion is that the statistics of the motions 
will still be a well defined probability distribution on 
phase space provided the ergodic hypothesis is extended 
to mean that the permutation of the cells on the attractor 
is a one-cycle permutation: it will be, in this case, still 
the uniform distribution concentrated on the cells lying 
on the attractor. 

This viewpoint unifies the conception of the statis- 
tics of equilibrium and of stationary nonequilibrium: the 
statistics fi of the motions, i.e. the probability distribu- 
tion /i such that, in the continuous version of the models, 

lim - f F(S t x)dt = f F(y)n(dy), (7) 

for all smooth observables F and for all but a set of zero 
volume of points x on phase space, can be considered, 
in equilibrium as well as in stationary non equilibrium 
states, as a probability distribution which is uniform on 
the attractor. 

The key obstacle to the above conception of Statisti- 
cal Mechanics for stationary states is that phase space 
cells cannot be supposed to evolve, under the evolution 
assigned by 4 when a + > 0, keeping a constant volume. 
Therefore regarding evolution as a map between cells of a 
discretized version of phase space contains new sources of 
possible errors. Besides the error that is present in equi- 
librium theory due to the cells deformations which leads 
to violations of the uniqueness, [11], there is an error due 
to their contraction er + > 0. 
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In equilibrium the first error can be reduced by re- 
ducing the cells size and the time intervals at which the 
observations (to be interpolated into the estimate of the 
integral in 7) are taken. This is a nontrivial source of 
errors that can be estimated to be physically acceptable, 
at least for the evaluation of the averages of the few ob- 
servables relevant for Thermodynamics, only in certain 
regions of the phase diagrams, see Ch. I in [11]. But 
at least in such regions the discrete interpretation of the 
ergodic hypothesis leads us to a consistent representa- 
tion of the evolution as a permutation between discrete 
elements of a partition of phase space into small cells. 

Out of equilibrium the further source of discretization 
error due to the actual reduction of phase space volume 
implies that it is not consistent to view the motion as 
a permutation of cells of a discretization of phase space 
into small equal volume elements. 

A possible way out is to restrict attention to systems 
that show strongly chaotic behavior. For instance sys- 
tems which are transitive (i.e. admit a dense orbit) and 
hyperbolic, see [23] for a formal definition, are typically 
chaotic systems which are also quite well understood. 

To enter into some detail it is convenient to look at the 
time evolution by drawing a few surfaces Si, S2, . . . , S s 
transversal to the phase space trajectories, and such that 
the trajectories cross some of the surfaces over and over 
again (i.e. each trajectory crosses the surfaces infinitely 
many times both in the future and in the past). Let 
S = Lij Sj (usually called a "Poincare's section" ) and let 
S be the map which transforms a point £ e S (i.e. on 
one of the surface elements Si, S2, . . . , S s ) into the point 
S£ where the orbit of £ meets again for the first time S 
(i.e. it is again on one of the surface elements defining 
S). 

The points in phase space can therefore be described 
by pairs x = (£, 6) if £ is the point in S last visited by the 
trajectory starting at x and 9 is the time elapsed since 
that moment. 

It is possible to partition S into regions Pi , P2, ■ ■ . , P n 
with the property that the symbolic dynamics histories 
a = {(Ti}°^_ 00 on the sets P„, u = 1, . . . , n, has a Markov 
property, in the sense that 

(1) there is a suitable matrix M CT;Cr > with entries or 1, 
such that if M auai+1 = 1 for all i then there is a unique 
point x such that S l x E P ai : the point x is said to be 
"coded" by the sequence er. And 

(2) calling compatible a sequence a with M ai ^ i+1 = 1 
then for all points x there is at least one compatible se- 
quence er which codes x and for all but a set of zero 
volume relative to S the sequence <x is unique (i.e. much 
as it is the case in the binary representation of real num- 
bers). 

The partition P ll P 2l . . . ,P n is then called a Markov 
partition: since the set of exceptions in the correspon- 
dence x< — >er has zero volume, the volume distribution 
can be represented as a probability distribution fiQ over 
the space of compatible sequences. And the statistics of 
the evolution of data £ chosen at random with respect 



to the distribution fi a , which is the main object of inter- 
est, will therefore be represented also by a S'-invariant 
probability distribution on the space of the compatible 
sequences <x, [23]. 

The sets Pi , P 2 , . . . , P n can be used to represent con- 
veniently the microscopic states of the system: given a 
precison h > it is possible to find Nh such that the sets 



N h 



Pa. 



n s- 

j=-N h 



(8) 



have a diameter < h. Therefore the (nonempty) sets 
A = P a _ N ,...,a N can be conveniently used as "cells" to 
describe the evolution, when the size h is small enough 
for considering acceptable to neglect the variations of the 
(few) interesting observables within the A's. 

The evolution S will stretch A along the unstable 
planes and compress it along the stable ones: it will 



map A = P a 



h >■■■ 



inside the union of the n sets 



U,r-P(T_JV h + l, 

We then imagine that the cell A is filled by smaller 
boxes, that will be called microscopic cells or simply mi- 
crocells, of equal volume, which under the action of S 
are transformed into boxes contained in only one of the 
n sets A (j = P a _ N +1 ,...,& N )(T . The microcells, which in a 
simulation could be identified with the integers defining 
them in the computer memory, should be thought of as 
arranged in layers adjacent to unstable planes of S and 
are mapped into microcells of the corresponding layers in 
the n cells A CT . 

Since the evolution, in the average, contracts phase 
space the layers will merge under the action of S so 
that the number of microcells will initially decrease; but 
eventually in each cell A will survive layers of microcells 
whose collection will be mapped one to one into itself: 
the latter collection of microcells is a representation of 
the attractor within the precision h. This is illustrated 
symbolically in Fig.l. 



Fig.l: The lines symbolize arrays of microcells A: the ones in the 
left drawing are stretched and merged by the time evolution into 
arrays that end up in three new boxes. 

For consistency the number of microcells that is even- 
tually found in each cell A is inversely proportional to the 
expansion rate A e (A) _1 of the surface elements on the 
unstable manifold in A: it will be denoted 7V"A e (A) _1 

The time evolution S can then be represented as a 
permutation of the N~Y1a Ae(A) -1 microcells on the at- 
tractor: 

(1) give a rule to select the A/"A e (A) _1 microcells in 
A = P a -N h ....,a Nh an d to partition them into n groups 
labeled by a — 1, ...,n, each containing a fraction 
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of the microcells selected in A and 



6 



(2) establish a correspondence S between the micro- 
cells in the group labeled a and a subset the ones in 
A = P 

(3) approximate S by replacing it by S. 

Certainly there is a lot of ambiguity in deciding how 
to set up the selection and the correspondence: but for 
the purposes of a description of dynamics with precision 
h the ambiguity has no consequence. Note that in simu- 
lations the microcells selection is implicitly prescribed by 
the program, and certainly changes quite substantially by 
any small change of the program or by a change of the 
computer used. By definition of program the evolution 5* 
is replaced by a map of microcells (in huge number even 
in simple simulations): the map is not invertible but be- 
ing a map of a finite set into itself it is eventually reduced 
to a permutation of a subset of the microcells. 

Transitivity of the compatibility matrix M implies that 
the permutation of the microcells on the attractor can 
be chosen cyclic: therefore the stationary distribution fj, 
will be approximated by the uniform distribution on the 
attractor: this is a picture which seems close to Boltz- 
MANN's conception of discretness and extends the ideas 
behind the ergodic hypothesis to more general dynamical 
systems, [11, 24]. 



V. A BIT OF HISTORY. 



The discovery of the probability distribution fi that de- 
scribes the statistics of the stationary states of dynamical 
systems with confined evolution did not follow the path 
discussed in Sec. 4: of course every theorem is preceded 
by a heuristic intuition and the exact genesis of the ideas 
should be asked to their Authors at least to the ones 
present here. But there is no certainty that they will 
give a faithful account, as it is well known that recollec- 
tion of past events in the human mind, even important 
ones, tends to be modified as years pass and new events 
occur. 

A possible history about Boltzmann's ergodic hy- 
pothesis and theory of ensembles is presented in Sec. 1,2. 

The theory of hyperbolic transitive systems is much 
more recent: they were formally introduced by Anosov 
who proved the stability of the notion under perturba- 
tions: a hyperbolic transitive system remains such if 
slightly perturbed. 

The existence of a well defined statistics for almost all 
initial data was heralded, [25], by the work of Adler 
and Weiss on the area preserving map of the torus 
S((fi, f>2) = (fi +^2,^1 + 2<^2): they define and make 
use of a Markov partition. The idea was independently 
developed by Sinai, [23, 26], who treated the general case 
of an Anosov map, building Markov partitions and using 
them to prove the existence of a privileged distribution 
/U giving the statistics of all initial data but a set of zero 
volume. 



A remarkable property of the distribution fj, emerges 
when it is regarded as a probability distribution on the 
compatible sequences a which code the points x phase 
space. Namely it is a "Gibbs distribution" , in the sense of 
probability theory, with a short range potential: this is, 
essentially, a Markov process with finitely many states, 
i.e. an object that is very well understood, [27]. The 
surprising consequence is that Anosov systems are "com- 
pletely integrable" in the sense that we can compute es- 
sentially everything at least in principle, [23, 28]. They 
become a 'paradigm for chaotic evolutions in the same 
sense in which harmonic oscillators are a paradigm for 
ordered motions. 

Hyperbolicity is a strong property which in practice is 
not satisfied in physical systems. Therefore Bowen, [29], 
and Ruelle, [30, 31], developed a theory for systems 
that are hyperbolic in a much weaker sense: these are 
systems with axiom A attractors. Also for such systems it 
is possible to define a natural distribution that describes 
the statistics of all but a set of zero volume of initial 
data and the distribution can be studied by an extension 
of Sinai's methods. 

The natural distribution has since been called the SRB 
distribution and Ruelle has proposed, in the very simi- 
lar context of turbulence theory, and at least three years 
earlier than it appeared in print [31, 32], that in general 
there should be a unique distribution (or possibly a finite 
number of them) describing the experimental statistics of 
motions: it should be the distribution giving the asymp- 
totic behavior of motions with arbitrary initial data apart 
from a set of zero volume. 



VI. DEVELOPMENTS. 



More recently Cohen and myself, [33] and see also 
[11], proposed cutting a "Gordian node" by an hypothesis 
which extends the viewpoint expressed by Ruelle in [34] 
"... while one would be very happy to prove ergodicity 
because it would justify the use of Gibbs ' microcanonical 
ensemble, real systems perhaps are not ergodic but behave 
nevertheless in much the same way and are well described 
by Gibbs' ensemble...'": 

Chaotic hypothesis: The asymptotic motions of a 
confined chaotic mechanical system develop on an at- 
tracting set on which motion can be considered a mixing 
Anosov flow. 

Of course this applies in particular to Hamiltonian sys- 
tems (where the attracting set is the full energy surface) 
and it implies ergodicity: hence the whole body of equi- 
librium Statistical Mechanics; furthermore it puts on the 
same level equilibrium and non equilibrium. 

This interpretation of Ruelle's ideas, [32, 34], has 
some applications because it implies a formal expression 
of the average values of the observables. Even though the 
expression is not (yet ?) computable in any interesting 
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case it may be useful to establish relations between av- 
erage values. For instance implications of a microscopic 
symmetry on macroscopic observables might be found 
from the formal (even if not practically computable) ex- 
pression of the SRB distribution. 

An example is obtained by considering an Anosov sys- 
tem (J 7 , S) , with T a smooth bounded manifold and S a 
smooth transitive hyperbolic map of T . Let 

a(x) = -log|deta x 5(a;)| (9) 

be the phase space contraction rate and let /i be the SRB 
distribution; suppose 

(a) dissipativity, i.e. <r + = J a(y)fi(dy) > 0, and 

(b) time reversal symmetry in the sense that there is a 
smooth isometry I such that IS = 

Define 

> = \'t^ (10, 

3=0 + 

then the following theorem holds, [33], 

Fluctuation theorem: With respect to the SRB dis- 
tribution the observable p satisfies a large deviation prop- 
erty (see below,) with a rate function C(p) which is ana- 
lytic and convex in an interval (—p*,p*), for a suitable 
p* > 1, where it exhibits the symmetry property 

a-?) = c(p) - p<?+ (ii) 



This means that the probability, with respect to the 
SRB distribution fi of (J 7 , S) , that p is inside an inter- 
val [a, b] C (-p*,p*) is P a ,b with lim r ^oo \ logP ai6 = 
max pe[a>6] ((p). 

Existence and analyticity of ((p) is part of the quoted 
general results of Sinai, while the symmetry 11 was 
pointed out in [33] in an attempt to explain the numerical 
results of an earlier computer experiment [35] . The inter- 
est of the theorem lies in the fact that it is a symmetry 
property: hence it holds without any free parameter. 

The theorem can be extended to mixing Anosov flows, 
[36], and therefore, via the chaotic hypothesis and if 
a(x) is the phase space contraction rate defined in 5, 
it becomes a property of essentially any system which is 
chaotic, dissipative and reversible. 

VII. ENTROPY? 



Interest in the properties of the observable cr(x), 5 for 
flows and 10 for maps, arose in several molecular dynam- 
ics simulations in which it was naturally related to the 
entropy creation rate. 



A natural question is whether a definition of entropy 
can be extended to noncquilibrium stationary states in 
analogy with the corresponding definition for equilibrium 
states (which are a very special case of stationary states) . 

The identification between the SRB distributions and 
distributions giving equal probability to the microcells in 
the attractor shows that it should be possible, at least, 
to define a function which is a Lyapunov function for the 
approach to stationarity: this would be an extension of 
the -ff-theorem of Boltzmann. However equality between 
the H function evaluated in equilibrium states and ther- 
modynamic entropy might be a coincidence, important 
but not extendible to non equilibrium (hence not neces- 
sary). Arguments in this direction can be found in the 
literature, [37, 38], and here the controversial aspects of 
this matter will not be touched, [39]. 

It will be worth however to enter into more details 
about why o~(x) has been called entropy creation rate. 
This is simply because in several experiments it had such 
an interpretation, being the ratio between a quantity that 
could be identified with the work per unit time done by 
the noncoservative forces stirring the system divided by 
a quantity identified with temperature of the thermostat 
providing the forces that extract the energy input from 
the stirring forces. The experiments were simulations 
and from many sides critiques were expressed because the 
interpretation seemed closely tied to the explicit form of 
the thermostats models, often considered "unphysical" . 

Furthermore the explicit dependence on the equations 
of motion makes the identification of cr(x) with the en- 
tropy creation rate quite useless if the aim is to compare 
the theory with experiments different from simulations 
because in real experiments {i.e. on experiments on mat- 
ter distinct from impressive arrays of transistors) there 
usually is no explicit model of thermostat force and it is 
difficult to evaluate o~{x). And it might turn out that the 
identification of a(x) with entropy creation rate is closely 
related to the special models considered. 

A simple, but quite general, model of thermostatted 
system may be useful to show that, while we should ex- 
pect that there is a relation between entropy creation 
rate and phase space contraction, still the two notions 
are quite different. 

The system consists in TV = No particles in a con- 
tainer Co and of A a particles in n containers C a which 
play the role of thermostats: their positions will be de- 
noted X a , a = 0,1, ... ,n, and X = (Xo, Xi, . . . , X„). 
Interactions will be described by a potential energy 

n n 

TT(X)-^[/ Q (X a )+^TT a (X ,X a ) (12) 

a— a— 1 

i.e. thermostats particles only interact indirectly, via the 
system. All masses will be m = 1, for simplicity. 

The particles in Co will also be subject to external, 
possibly nonconservative, forces F(Xo,4>) depending on 
a few strength parameters $ = (E 1 ,E 2 , ■ ■ ■)■ It is con- 
venient to imagine that the force due to the confining 
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potential determining the region C is included in F, so 
that one of the parameters is the volume V = \Cq\. See 
Fig. 2 below. 



Fig. 2 The reservoirs occupy finite regions outside Cq, e.g. sectors 
C a C -R 3 , a — 1 , 2 . . . . Their particles arc constrained to have a 
total kinetic energy K a constant, by suitable forces i? a , so that the 
reservoirs "temperatures" T a , see 14, are well defined. 

The equations of motion will be, assuming the mass 
m = 1, 



X 0J = -a i C/o(X )-Ea^ C/ a( X ! X i ) + F i (13) 

X ai = -d l U a (X a ) - dj[/ a (Xo,Xj) - a a X a 

where the last force term — a a X a is a phenomenologi- 
cal force that implies that the thermostats particles keep 
constant kinetic energies: 



Na 1 q q 

^ = E \ (*?) 2 = \N a k B T a \ N .tf (14) 

where the parameters T a should define the thermostats 
temperatures and a a can, for instance, be defined by 



— Ota = 



ef L a - U a 



3N a k B T a 



(15) 



where L a = — &x_ a W a (X. , X a ) • X a is the work done per 
unit time by the forces that the particles in C exert on 
the particles in C a . 

The exact form of the forces that have to be added in 
order to insure the kinetic energies constancy should not 
really matter, within wide limits. But this is a property 
that is not obvious and which is much debated. The 
above thcrmostatting forces choice is dictated by Gauss' 
least effort principle for the constraints K a = const, see 
appendix 9.4 in [11]: this is a criterion that has been 
adopted in several simulations, [40]. Independently of 
Gauss' principle it is immediate to check that if a a is 
defined by 15 then the kinetic energies K a are strictly 
constants of motion. 

The work L a in 15 will be interpreted as heat Q a ceded, 
per unit time, by the particles in Co to the a-th thermo- 
stat (because the "temperature" of C a remains constant, 
hence the thermostats can be regarded in thermal equilib- 
rium). The entropy creation rate due to heat exchanges 
between the system and the thermostats can, therefore, 
be naturally defined as 



a°(X,X) 



ief \ - 



Qa 

^ k B T a 

a=l 



(16) 



It should be stressed that here no entropy notion is in- 
troduced for the stationary state: only variation of the 
thermostats entropy is considered and it should not be re- 
garded as a new quantity because the thermostats should 
be considered in equilibrium at a fixed temperature. 

The question is whether there is any relation between 
(To and the phase space contraction a of 5. The lat- 
ter can be immediately computed and is (neglecting 
0(min 0> oiV- 1 )) 



a r (X,X) = ^ 



o>0 



3iVa -lQg-Ug 

3iV a k B T a 



E 

a>0 



k B T a 



-17(17) 



where U = X)a>o 3 bat" 1 k^T ■ Hence in this example 
in which the thermostats are "external" to the system 
volume (unlike to what happens in the common examples 
in which they act inside the volume of the system), the 
phase space contraction is not the entropy creation rate, 
[41]. However it differs from the entropy creation rate by 
a total derivative. 

The latter remark implies that if the chaotic hypoth- 
esis is accepted for the system in Fig. 2 then, assum- 
ing U a bounded (for simplicity, see [41, 42] for more 
general cases) it is cr + = (cr ) because the deriva- 
tive U contributes -(U(t) — U(0)) T ^oo> and also the 
observable p, in the continuous time extension of 10, 
[36], has the same rate function as the observable p = 
\ J Q T cro(Stx) dt = \ f Q a(S t x) dt + 0{t- x ). Since the 
equations of motion 13 are time reversible (a rather gen- 
eral property of Gaussian constraints, with I being here 
simply velocity reversal) it follows that the "physical en- 
tropy creation" 16 has a fluctuations rate £(p) satisfying 
the fluctuation relation 11. 

This is relevant because the definition 16 has mean- 
ing independently of the equations of motions and can 
therefore be suitable for experimental tests. [41,43]. The 
above is just a model of thermostats: other interesting 
models have been proposed based on purely Hamiltonian 
interactions at the price of relying on thermostats of in- 
finite size, see [44-46]. 



VIII. EXTENSIONS OF BOLTZMANN'S 
ff-THEOREM 



The above analysis does not require a notion of entropy 
to be defined for stationary states. 

There is, however, another key contribution of Boltz- 
mann to Statistical Mechanics, briefly mentioned above. 
This is the Boltzmann's equation and the relative H- 
theorem, [47]. 
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The theorem has attracted deep interest because of its 
philosophical implications. For our purposes it is impor- 
tant because it provides a theory of approach to equilib- 
rium and therefore it is one of the first results on noncqui- 
librium. 

It is useful to stress that the definition of H is given in 
the context of the approach to equilibrium and Boltz- 
MANN never applied it (nor, perhaps, meant to apply it) 
to the approach to other stationary states and to their 
theory. The equality of the value of H with the ther- 
modynamic entropy when evaluated on the equilibrium 
state raised the hope that it could be possible to define 
entropy for systems out of equilibrium and even if not 
in stationary state. The idea emerged clearly already 
from the foundational papers on the Boltzmann equa- 
tion ("logP was well defined whether or not the system 
is in equilibrium, so that it could serve as a suitable gen- 
eralization of entropy" , p. 82 in [48] and p. 218 in [21]) 
and many attempts can be found in the literature to de- 
fine entropy for systems out of equilibrium in stationary 
states or even in macroscopically evolving states. 

Strictly speaking the implication that can be drawn 
from the works on the Boltzmann's equation is that a 
rarefied gas started in a given configuration evolves in 
time so that the average values of the observables, at 
least of the few of interest, acquire an asymptotic value 
which is the same as the one that can be computed from 
a probability distribution maximizing a function H. 

The acquisition of an asymptotic value by the averages 
of the observables is a property expected to hold also 
when the asymptotic state is a noncquilibrium stationary 
state. And it is natural to think that also in such cases 
there will be a function that approaches monotonically 
an asymptotic value signaling that the few observables 
of interest approach their asynptotic average. 

As remarked above the SRB distribution is a uniform 
distribution over the attractor: therefore it verifies a vari- 
ational property and this can be used to define a Lya- 
punov function that evolves towards a maximum, [37]. 

Let 77 = {(J- Nh , . . . , a Nh ) and H d = \ J2r, ~Vn ( lo SPn + 
log A t (t/ -1 )) where p v denotes the fraction of microcells 
that can be found in the cell A = P v = r\^_ Nh S~ k P nk 
after a time of r units has elapsed starting from an initial 
distribution (typically a uniform distribution over the 
microcells in a single cell A ). This is a quantity that 
tends to a maximum as time evolves (reaching it when 
the p n have the value of the SRB distribution and the 
maximum equals, therefore, the logarithm of the number 
of microcells on the attractor). 

Therefore the quantity H tends in the average to a 
maximum and it can be regarded as an instance of an 
.ff-function. However the maximum depends on the pre- 
cision h of the coarse graining defined by the partition 
of phase space by the cells A. Changing the precison 
several changes occur which have to be examined if a 
meaning other than that of a Lyapunov function has to 
be given to H. The analysis in [37] points out that H 



changes with the precision ft in a trivial way {i.e. by an 
additive constant, independent of the control parameters 
of the system and depending only on the precision h) if 
the SRB state on which it is evaluated is an equilibrium 
state. In the latter case it is proportional to the loga- 
rithm of the phase space volume that can be visited. In 
the noncquilibrium cases however H changes when the 
precision h changes by additive quantities that are not 
just functions of h but depend on thermodynamic quan- 
tities, (like average energy, temperatures, etc. .), [37]. 

This indicates that while not excluding the possibility 
of existence of Lyapunov functions, see [49], indicating 
the approach to equilibrium (within a given precision h) 
the identity of the H function with entropy, i.e. its iden- 
tity with a function of the state parameters of the system, 
is possible only when the state is in an equilibrium state. 
My interpretation of this analysis, based once more on a 
discrete point of view on the problem, is that one should 
not insist in looking for a notion of entropy in systems 
out of equilibrium, [37]. 

If so once again Boltzmann's attitude to consider 
phase space as discrete and in general to deny reality 
to the continua might have led to insights into difficult 
questions. 



IX. CONCLUSION 



Boltzmann's contribution to the theory of ensembles 
and to the mechanical interpretation of heat and Ther- 
modynamics was based on a discrete conception of the 
continuum: his staunch coherence on this view has been 
an essential aspect of the originality of his thought. 

It is in fact a method of investigation which is still 
very fruitful and used in various forms when "cut-offs" or 
"regularizations" are employed in the most diverse fields. 
In my view it has been and still is important in the re- 
cent developments in the theory of nonequilibrium sta- 
tionary states. The Fluctuation theorem and its various 
interpretations, extensions and applications (to Onsagcr 
reciprocity at non zero forcing, to Grcen-Kubo formu- 
lae, to fluid Mechanics, Turbulence and Intermittency, 
see [11, 23, 50]) is, hopefully, only an example. 

It is interesting in this context recall a few quotes from 
Boltzmann 

" Through the symbols manipulations of integral calcu- 
lus, which have become common practice, one can tem- 
porarily forget the need to start from a finite number of 
elements, that is at the basis of the creation of the con- 
cept, but one cannot avoid if; 
see p. 227 in [51], or in the same page: 

"Differential equations require, just as atomism does, 
an initial idea of a large finite number of numerical val- 
ues and points Only afterwards it is maintained 

that the picture never represents phenomena exactly but 
merely approximates them more and more the greater the 
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number of these points and the smaller the distance be- 
tween them. Yet here again it seems to me that so far we 
cannot exclude the possibility that for a certain very large 
number of points the picture will best represent phenom- 
ena and that for greater numbers it will become again 
less accurate, so that atoms do exist in large but finite 
number." 

and, see p. 55 in [51]: 

" This naturally does not exclude that, after we got used 
once and for all to the abstraction of the volume ele- 
ments and of the other symbols [of Calculus] and once 
one has studied the way to operate with them, it could 
look handy and luring, in deriving certain formulae that 
Volkmann calls formulae for the coarse phenomena, to 
forget completely the atomistic significance of such ab- 
stractions. They provide a general model for all cases in 
which one can think to deal with 10 10 or 10 10 elements 
in a cubic millimeter or even with billions of times more; 
hence they are particularly invaluable in the frame of Ge- 
ometry, which must equally well adapt to deal with the 
most diverse physical cases in which the number of the 
elements can be widely different. Often in the use of all 
such models, created in this way, it is necessary to put 
aside the basic concept, from which they have overgrown, 
and perhaps to forget it entirely, at least temporarily. But 
I think that it would be a mistake to think that one could 
become free of it entirely." 

And the principle was really applied not only in the 
conception of the ergodic hypothesis, [2, 6], but also in 
the deduction of the Boltzmann's equation which Boltz- 
MANN felt would be clarified by following discretization 
methods (in energy) inspired by those employed in the 
"elegant solution of the problem of string-vibrations" of 
Lagrange, or in Stefan's study of diffusion or in Rie- 
MANN's theory of mean curvature, [47] and in various 
discussions of the heat theorem, [21]. 

The above conception of the infinitesimal quantities, 
rooted in the early days of Calculus when "dx" was re- 
garded as infinitely small and yet still of finite size (in 
apparent, familiar, logical contradiction), is an impor- 
tant legacy that should not be forgotten in spite of the 
social pressure that induces all of us to identify clarity of 
physical understanding with continuous models of reality. 

Appendix: Temperature and kinetic energy, [3, 15] 

The first attempts at a kinetic explanation of the prop- 
erties of gases came following the experiments by Boyle, 
(1660), on the gas compression laws. The laws estab- 
lished that "air" had elastic properties and that there was 
inverse proportionality between pressure and volume: a 
theory that was considered also by Newton. It was D. 
Bernoulli, (1720), who abandoned the view, espoused 
by Newton, that the atoms were arranged on a kind of 
lattice repelling each other (with a force inversely propor- 
tional to their distances to agree with Boyle's law, but 
extending only to the nearest neighbors). Bernoulli 
imagined the atoms to be free and that pressure was due 



to the collisions with the walls and proportional to the 
square of the average speed proposing that a correct def- 
inition of temperature should be based on this property. 

In 1816 AvOGADRO established that, for rarefied gases, 
the ratio pV/T is proportional to the number of atoms or 
molecules via a universal constant. This was a striking 
result, explaining the anomalies in the earlier theory of 
Dalton and allowing, besides the definition of the Avo- 
gadro 's number, the correct determination of the relative 
molecular and atomic weights. It opcnend the way to 
the definition of absolute temperature, independently of 
the special gas-thermometer employed, and to the prin- 
ciple of energy equipartition and to the later works of 
Waterston, CLAUSIUS, Boltzmann, among others. 

The attempt of Laplace, (1821), proposed an elab- 
orate scheme in which the atoms, still essentially fixed 
in space at average distance r would contain a quantity 
c of caloric and would interact with a short range force 
proportional to the product of their quantity of caloric 
and depending on the distance. Identifying the caloric c 
with the a fixed amount contained in each atom would 
have led to a gas law with p proportional to the square 
of the density p, i.e. to p 2 c 2 : but this was avoided by 
supposing that the amount of caloric c in each molecule 
was determined by an equilibrium between the amount 
of caloric emitted by a molecule and the caloric received 
by it (emitted from the other molecules) which was sup- 
posed to depend only on the temperature, see [3]. 

The theory of Laplace did not sound convincing and 
the work of Bernoulli went unnoticed; the same was 
the fate of the work of Herapath, (1820), who again 
proposed, without knowing Bernoulli's theory, that 
the atoms were free and pressure was due to collisions 
with the walls; however he assumed that pressure was 
proportional to the average momentum rather than ki- 
netic energy obtaining an incorrect definition of absolute 
temperature. In any event his work was rejected by the 
Philosophical transactions of the Royal Society of Lon- 
don and published on the Annals of Philosophy falling 
into oblivion for a while. 

In 1845 Waterstone, unaware of both Bernoulli 
and Herapath but (likely) familiar with Avogadro's 
work, proposed the theory of gases with the correct iden- 
tification of pressure as proportional to the average ki- 
netic energy and the density, introducing also a rather de- 
tailed conception of te interatomic forces taking up ideas 
inspired by Mossotti (who probably had also made 
AvOGADRO and Italian science better known in England 
during his political exile). Unfortunately he submitted 
it to the Philosophical transactions of the Royal Society 
which readily rejected it and remained unpublished, until 
it was rediscovered much later (1892, by Raileigh). 

In the 1840's, through the work of Meyer, Joule, 
Helmoltz and others the energy conservation principle 
was established with the consequent identification of heat 
as a form of energy convertible into mechanical work forc- 
ing (reasonable) physicists to abandon the hypothesis of 
the existence of caloric as a conserved entity. 
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The theory of gases begun to be really accepted with 
the work of Kronig, (1856), who clearly proposed 
identifying temperature with average kinetic energy of 
molecules. His work became well known as it appeared 
to have prompted the publication of Clausius's paper of 
(1857), who had independently reached the same conclu- 
sions and gone much further. Not only Clausius went 
quite far in establishing energy equipartition (completed 
by Maxwell in 1860) but he introduced a basic concept 
of kinetic theory: the mean free path. Thus making clear 
the role of collisions in gas theory: they lead to prediction 
and to a first understanding of the phenomenon of dif- 
fusion, explaining the apparent paradoxes linked to the 
earlier assumptions that in rarefied gases collisions could 
be simply neglected, and also initiate the theory of the 
transport coefficients. 

The latter papers, one century after the too far in ad- 
vance (over his time) work of Bernoulli, gave origin 
to kinetic theory in the sense we intend it still now, and 
stimulated also the related investigations of Maxwell. 



Therefore Maxwell (1859) and a little later Boltz- 
MANN (1866) could start their work taking for granted 
the well established identity between temperature and 
average kinetic energy for gases extending it to hold in 
all systems in equilibrium (rarefied or not). This key 
view was not destined to have a long life: the advent 
of Quantum Mechanics would prove that proportionality 
between average kinetic energy and temperature could 
only be approximate and to hold if quantum corrections 
to Atomic Mechanics were negligible, see [11]. Never- 
theless the identification of temperature and kinetic en- 
ergy plaid (and still plays, whenever quantum effects are 
negligible) an essential role not only in classical Statisti- 
cal Mechanics but also in the discovery of Quantum Me- 
chanics, which was heralded by the failure of the related 
equipartition of energy 

Source of the talk at the Boltzmann's Legacy inter- 
national symposium at ESI, Vienna, 7-9 June, 2006 
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